Simulation method and system

ABSTRACT

A simulation method and system. A computing system receives a first audio and/or video data stream. The first audio and/or video data stream includes data associated with a first person. The computing system monitors the first audio and/or video data stream. The computing system identifies emotional attributes comprised by the first audio and/or video data stream. The computing system generates a second audio and/or video data stream associated with the first audio and/or video data stream. The second audio and/or video data stream includes the data without the emotional attributes. The computing system stores the second audio and/or video data stream.

FIELD OF THE INVENTION

The present invention relates to a method and associated system forsimulating emotional attributes within audio/video data streams.

BACKGROUND OF THE INVENTION

Modifying multiple data files typically comprises an inefficient processwith little flexibility. Accordingly, there exists a need in the art toovercome the deficiencies and limitations described herein.

SUMMARY OF THE INVENTION

The present invention provides a simulation method, comprising:

receiving, by a computing system, a first audio data stream, whereinsaid first audio data stream comprises first speech data associated witha first person;

monitoring, by said computing system, said first audio data stream;

identifying, by said computing system in response to said monitoringsaid first audio data stream, first emotional attributes comprised bysaid first audio data stream;

generating, by said computing system in response to said identifyingsaid first emotional attributes, a second audio data stream associatedwith said first audio data stream, wherein said second audio data streamcomprises said first speech data, and wherein said second audio datastream does not comprise said first emotional attributes; and

storing, by said computing system, said second audio data stream.

The present invention provides a simulation method, comprising:

receiving, by a computing system, a first video data stream, whereinsaid first video data stream comprises first video data associated witha first person;

monitoring, by said computing system, said first video data stream;

identifying, by said computing system in response to said monitoringsaid first video data stream, first emotional attributes comprised bysaid first video data;

generating, by said computing system in response to said identifyingsaid first emotional attributes, a second video data stream associatedwith said first video data stream, wherein said second video data streamcomprises second video data associated with said first person, andwherein said second video data does not comprise said first emotionalattributes; and

storing, by said computing system, said second video data stream.

The present invention advantageously provides a system and associatedmethod capable of modifying multiple data files.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates a block diagram view of a system for simulatingdisabilities and/or enhancing audio/video data streams, in accordancewith embodiments of the present invention.

FIG. 2 illustrates an algorithm describing a process used by the systemof FIG. 1 for simulating disabilities and/or enhancing audio/video datastreams, in accordance with embodiments of the present invention.

FIG. 3 illustrates a first internal block diagram view of thesimulation/enhancement software application of FIG. 1, in accordancewith embodiments of the present invention.

FIG. 4 is an implementation example illustrating the results ofexecuting the emotion identification module and the emotion erasermodule of FIG. 3, in accordance with embodiments of the presentinvention.

FIG. 5 is an implementation example illustrating the results ofexecuting the focus/attention/transformer module, the focusidentification module, and the focus eraser module of FIG. 3, inaccordance with embodiments of the present invention.

FIG. 6 illustrates a second internal block diagram view of thesimulation/enhancement software application of FIG.1, in accordance withembodiments of the present invention.

FIG. 7 is an implementation example illustrating the results ofexecuting the emotion enhancer module of FIG. 6, in accordance withembodiments of the present invention.

FIG. 8 illustrates a computer system used for simulating disabilitiesand/or enhancing audio/video data streams, in accordance withembodiments of the present invention.

DETAILED DESCRIPTION OF THE INVENTION

FIG. 1 illustrates a block diagram view of a system 2 for simulatingdisabilities and/or enhancing audio/video data streams, in accordancewith embodiments of the present invention. System 2 performs a processfor:

1. Simulating an environment that a disabled person (e.g., an autisticperson) perceives. The simulated environment is used by a person withouta disability (e.g., product designers, parents, teachers, managers, etc)in order to increase awareness and sensitivities to people havingdisabilities. System 2 monitors and modifies inputted audio/video media(e.g., an audio/video data stream associated with, inter alia, atelevision show, a movie, a live event (e.g., a concert, a live speaker,etc.), etc. Audio and video media are independently modified to removeemotion cues and focus on given objects. System 2 monitors real-timeevents (e.g., movies, television shows, concerts, etc) and presents themin a modified manner on a visual display or in a virtual simulator. Realworld audio and video media is obtained from external sensors such asmicrophones, cameras, and biometric sensors. Audio and video media fromvirtual simulators is interpreted directly via source code. System 2identifies (e.g., by detecting specific facial expressions, by detectingspecific vocal expressions, etc) and removes emotional events (e.g., byremoving and replacing video comprising specific facial expressions, byremoving and replacing specific vocal expressions, etc) from audio andvideo media. As a first example, video data illustrating smiling facesmay be modified or replaced to show neutral, expressionless faces. As asecond example, audio data that comprises emotion (e.g., happy or sadvoices, exited voices, etc) may be modified or replaced to soundmonotone and expressionless. System 2 deconstructs real-time audio andvideo media and transforms it such that a viewer without a disability(e.g., a non-autistic person) may experience what a viewer with adisability (e.g., an autistic person) may perceive. The identificationand removal of emotion from audio and video media may comprise a hybridprocess that involves automatic and human editing. Alternatively, theidentification and removal of emotion from audio and video media maycomprise an automatic process. Additionally, system 2 may remove a focuson central objects (i.e., from video media). For example, an autisticperson may have difficulty identifying a central or main object in avisual image and therefore system 2 may identifies the central object inthe visual image and modifies the visual image such that it no longerappears as the central object. The central object may be identified bybeing the largest object in a display.

2. Aiding a disabled viewer (e.g., an autistic person) to view modified(i.e., with removed emotions) audio/video media using audible or visualcues. System 2 may add the audible or visual cues to the modifiedaudio/video media thereby guiding a disabled person to help them noticemoments of emotion and maintain focus. The emotional cues identified bythe system 2 are used to generate audible/visual cues for disabledusers. For example, if a person in video data is identified as having ahappy smiling face and/or speaking in a happy tone (i.e., after theemotions are removed), system 2 may generate a visual or audio cue(e.g., a red circle surrounding the person or a specified audible sound)at the moment of the emotional event. System 2 may be implemented inreal-life settings through visual display, edited media (movies,television, radio, etc), in virtual simulations, etc.

System 2 of FIG. 1 comprises a computing apparatus 9, an original audiofile/stream 5 a, an original video file/stream 5 b, and asimulated/enhanced audio and/or video file/stream 8 a connected to acomputing system 10 through a network 7. Network 7 may comprise any typeof network including, inter alia, a local area network, (LAN), a widearea network (WAN), the Internet, etc. Original audio file/stream 5 a,original video file/stream 5 b, and simulated/enhanced audio and/orvideo file/stream 8 a may each be comprised by a computer, a database, arepository, etc. Computing apparatus 9 is used as an administrator forgenerating simulated/enhanced audio and/or video file/stream 8 a.Computing apparatus 9 may comprise, inter alia, a personal computer, alaptop computer, a computer terminal, a telephone, etc. Computingapparatus 9 may comprise a single computing apparatus or a plurality ofcomputing apparatuses. Computing system 10 may comprise any type ofcomputing system(s) including, inter alia, a personal computer (PC), aserver computer, a database computer, etc. Computing system 10 maycomprise a single computing system or a plurality of computing systems.Computing system 10 comprises a memory system 14. Memory system 14 maycomprise a single memory system. Alternatively, memory system 14 maycomprise a plurality of memory systems. Memory system 14 may be internalto computing system (e.g., as illustrated in FIG. 1) or external tocomputing system 10. Memory system 14 comprises a simulation/enhancementsoftware application 18 and repository 17. Repository 17 may be locatedinternal to memory system 14 or external to memory system 14. Repository17 may be a database. Repository 17 comprises audio/video data generatedand/or synchronized by simulation/enhancement software application 18(e.g., simulated/enhanced audio and/or video file/stream 8 a).Simulation/enhancement software application 18 may comprise a pluralityof software modules (as described with respect to FIG. 3, infra) forsimulating disabilities and/or enhancing audio/video data streams.

Simulation/enhancement software application 18 allows for the followingsteps to be performed:

-   1. Computing system 10 receives information (i.e., original audio    file/stream 5 a, original video file/stream 5 b, and biometrics    data).-   2. Simulation/enhancement software application 18 transforms the    information (i.e., original audio file/stream 5 a, original video    file/stream 5 b, and biometrics data) to generate a simulation    describing how a person with a disability (e.g., autism) perceives    their surroundings.-   3. The simulation is presented to a viewer via an audio/video    monitor. The viewer may comprise a person without a disability.-   4. The simulation or original input (i.e., original audio    file/stream 5 a, original video file/stream 5 b, and biometrics    data) may additionally be modified in order to generate visual and    audio modifications that enable a user with a disability to    understand the simulation or original input. For example,    information (e.g., visual or audio cues) may be added to the    simulation to enable a user with a disability to follow the flow of    any given event. The user with the disability will be able view or    listen to the simulation with enhanced marks or labels that    illustrate emotions (i.e., as described with reference to FIG. 5).

FIG. 2 illustrates an algorithm describing a process used by system 2 ofFIG. 1 for simulating disabilities and/or enhancing audio/video datastreams, in accordance with embodiments of the present invention. Instep 202, a computing system (e.g., computing system 10 in FIG. 1)receives a first (original) audio file/stream (e.g., original audiostream/file 5 a in FIG. 1). The first audio file/stream includes firstspeech data associated with a first person. In step 204, the computingsystem receives a first video file/stream (e.g., original videostream/file 5 b in FIG. 1). The first video file/stream includes firstvideo data associated with the first person. The first audio file/streamand the first video file/stream may be associated with, inter alia, atelevision show, a movie, a live event (e.g., a concert, a live speaker,etc.), etc. In step 208, a selection is made to:

-   1. Identify emotional attributes associated with the first audio    file/stream and/or the first video file/stream. or-   2. Locate main focus objects (i.e., locate and identify a central    object in a visual image and modify the visual image such that the    central object no longer appears as the central object as described    with respect to FIG. 5).

If in step 208, the selection is made to locate main focus objects thenin step 228, the first video file/stream (and/or first audiofile/stream) is monitored and the main focus object and a backgroundfocus object are located and identified. The main focus objects may beidentified by locating a largest object in an image and the backgroundfocus object may be identified by locating a smallest object in theimage. In step 232, the image views (or audio levels) for the main focusobject and the background focus object (e.g., a smallest object in theimage, a low audio level) are modified (e.g., a size for the main focusobject and a size for the background focus object may be reversed suchthat the main focus object appears smaller than the background focusobject). In step 235, portions of the first audio file/stream aresynchronized with associated portions of the first video file/stream(i.e., comprising the modified image views from step 232) and storedand/or broadcasted for a viewer.

If in step 208, the selection is made to identify emotional attributesassociated with the first audio file/stream and/or the first videofile/stream then in step 212, the first video file/stream and/or thefirst audio file/stream is monitored and emotional attributes associatedwith the first audio file/stream and/or the first video file/stream areidentified (e.g., by detecting specific facial expressions, by detectingspecific vocal expressions, etc). In step 218, a second audiofile/stream associated with said first audio file/stream is generated.The second audio file/stream comprises the first speech data associatedwith the first person and does not comprise the emotional attributesidentified in step 212. In step 220, a second video file/ streamassociated with said first video file/stream is generated. The secondvideo file/stream comprises the first video data associated with thefirst person and does not comprise the emotional attributes identifiedin step 212. In (optional) step 225, visual or audio labels are added tothe second audio file/stream and/or the second video file/stream (toportions associated with the emotional attributes identified in step212). Additionally, a third audio data stream associated with the secondaudio data file/stream may be generated. The third audio data streamincludes the second audio data file/stream and a first audible portionof the second audio data file/stream that comprises an audible label. Asan additional embodiment, portions of the third audio data stream may besynchronized with associated portions of the second video datafile/stream and a first audio/video data stream associated with thethird audio data stream and said second video data file/stream may begenerated and broadcasted. Additionally, visual objects of video data(i.e., from step 220) may be associated with an attribute of theemotional attributes of step 212. A viewable label (indicating theattribute) for a first visual object (of the visual objects) may begenerated and applied to the first visual object. In response, a thirdvideo data stream associated with the second video data file/stream maybe generated. The third video data stream may include the second videodata file/stream and the first visual object comprising the viewablelabel. In (optional) step 228, the second video file/stream (e.g.,comprising the visual labels or not comprising the visual labels) and/orfirst audio file/stream are monitored and a main focus object and abackground focus object (in the second video file stream) are locatedand identified. The main focus objects may be identified by locating alargest object in an image and the background focus object may beidentified by locating a smallest object in the image. In (optional)step 232, the image views for the main focus object and the backgroundfocus object (e.g., a smallest object in the image) are modified (e.g.,a size for the main focus object and a size for the background focusobject may be reversed such that the main focus object appears smallerthan the background focus object). In step 235, portions of the secondaudio file/stream are synchronized with associated portions of thesecond video file/stream (e.g., comprising visual or audio labels fromstep 225 and/or comprising the modified image views from step 232) andstored and/or broadcasted for a viewer.

FIG. 3 illustrates a first internal block diagram view ofsimulation/enhancement software application 18, in accordance withembodiments of the present invention.

Simulation/enhancement software application 18 comprises a plurality ofsoftware modules:

-   1. Audio transformer module 18 a.-   2. Emotion identification module 18 b.-   3. Video transformer module 18 c.-   4. Emotion eraser module 18 d.-   5. Focus/attention/transformer module 18 e.-   6. Focus identification module 18 f.-   7. Focus eraser module 18 g.-   8. Integrator module 18 h.

Audio transformer module 18 a converts an original audio file/stream(e.g., removes emotion events, adds audio cues, etc from original audiofile/stream 5 a of FIG. 1) into simulated/enhanced/audio and/or videosteams/files 8 a (i.e., from FIG. 1). Video transformer module 18 cconverts an original video file/stream (e.g., removes emotion events,adds video cues or labels, modifies image/focus views, etc from originalvideo file/stream 5 b of FIG. 1) into simulated/enhanced/ audio and/orvideo steams/files 8 a. Emotion identification module 18 b identifiesthe emotion events from original audio file/stream 5 a and originalvideo file/stream 5 b using biometric recognition systems. Emotioneraser module 18 d removes the emotion events identified by emotionidentification module 18 b. Focus identification module 18 f identifiesa main focus object and a background focus object requiring modificationso that that a video object (or an audio object) that is intended to bea center of attention becomes equivalent to a background video objects(or audio objects). For example, a main character speaking in a crowd ora specific car traveling on a highway may be modified to blend intobackground. The main focus objects may be identified by locating alargest object in an image and the background focus object may beidentified by locating a smallest object in the image.Focus/attention/transformer module 18 e uses focus identification datagenerated by focus identification module 18 f to identify and change aportion of the video object (or audio object) that presents focalelements or cues. Focus eraser module 18 g redistributes a size of themain focus objects (i.e., identified by focus identification module 18f) such that all objects of visual image or audio data have equal value.Integrator module 18 h combines transformed audio (i.e., transformed byaudio transformer module 18 a, emotion identification module 18 b,emotion eraser module 18 d, focus/attention/transformer module 18 e,focus identification module 18 f, focus eraser module 18 g, andintegrator module 18 h) with transformed video (i.e., transformed byvideo transformer module 18 c, emotion identification module 18 b,emotion eraser module 18 d, focus/attention/transformer module 18 e,focus identification module 18 f, focus eraser module 18 g, andintegrator module 18 h).

FIG. 4 is an implementation example illustrating the results ofexecuting emotion identification module 18 b and emotion eraser module18 d of FIG. 3, in accordance with embodiments of the present invention.Expression 400 illustrates a happy expression (i.e., a happy emotion).Expression 401 illustrates a sad expression (i.e., a sad emotion).Expression 402 illustrates a neutral expression (i.e., a neutralemotion). Expression 402 results from executing emotion eraser module 18d of FIG. 3 on expression 400 or 401.

FIG. 5 is an implementation example illustrating the results ofexecuting focus/attention/transformer module 18 e, focus identificationmodule 18 f, and focus eraser module 18 g of FIG. 3, in accordance withembodiments of the present invention. FIG. 5 illustrates two examplesfor implementing focus/attention/transformer module 18 e, focusidentification module 18 f, and focus eraser module 18 g of FIG. 3.

EXAMPLE 1

Example 1 illustrates an embodiment associated with a disabled (e.g.,autistic) viewer. In example 1, an original image 400 comprises anairplane 404 a and a house 408 a. House 408 a is a video object that isa main object (i.e., intended focus). Airplane 404 a is a video objectthat is a background object. Modified image 401 (i.e., modified byfocus/attention/transformer module 18 e, focus identification module 18f, and focus eraser module 18 g of FIG. 3) comprises an airplane 404 band a house 408 b. House 408 b has been enlarged in sized (i.e., fromhouse 408 a) to illustrate the main object (i.e., intended focus).Airplane 404 b has been reduced in size (i.e., from airplane 404 a) toillustrate the background object.

EXAMPLE 2

Example 2 illustrates an embodiment associated with a non-disabledviewer. Example 2 simulates (i.e., for a non-disabled viewer) anenvironment that a disabled person (e.g., autistic) perceives. Inexample 2, an original image 400 comprises an airplane 404 a and a house408 a. House 408 a in is a video object that is a background object.Airplane 404 a is a video object that is a main object (i.e., intendedfocus). Modified image 401 (i.e., modified byfocus/attention/transformer module 18 e, focus identification module 18f, and focus eraser module 18 g of FIG. 3) comprises an airplane 404 band a house 408 b. House 408 b has been enlarged in sized (i.e., fromhouse 408 a) to illustrate how difficult it is for a disabled viewer toidentify a main object. For example, an autistic person may havedifficulty identifying a central or main object in a visual image andtherefore house 408 b (background object) has been enlarged in size(i.e., from house 408 a) to illustrate how difficult it is for adisabled viewer to identify a main object. Likewise, airplane 404 b(i.e., main object) has been reduced in size (i.e., from airplane 404a).

FIG. 6 illustrates a second internal block diagram view ofsimulation/enhancement software application 18, in accordance withembodiments of the present invention. Simulation/enhancement softwareapplication 18 comprises a plurality of software modules:

-   1. Level labeler module 18 i.-   2. Label editor module 18 j.-   3. Emotion identification module 18 k.-   4. Emotion enhancer module 18 l.-   5. Focus identification module 18 m.-   6. Enhancement editor module 18 n.-   7. Editor module 18 o.

Audio and video objects for modification are identified and labeled bylevel labeler module 18 i. Labeling may comprise identifying categoriesof the audio and video objects. For example, identifying a face, a car,a musical instrument, etc. Label editor module 18 j is used to edit thelabels generated by level labeler module 18 i. Emotion identificationmodule 18 k performs a higher level labeling process. A higher levellabeling process may include identifying emotion, and focal elements inthe audio and video objects. Label editor module 18 j may be used toedit the labels generated by emotion identification module 18 k. Focusidentification module 18 m identifies areas of focus in audio and videoobjects. Editor module 18 o edits the areas of focus identified by focusidentification module 18 m. Emotion enhancer module 18 l identifies agiven emotional event with a visual cue on a display. For example, if aperson on the display illustrates a happy emotion, their face may beencircled with a red perimeter. Enhancement editor may be used to editthe emotional event identified by Emotion enhancer module 18 l.

FIG. 7 is an implementation example illustrating the results ofexecuting emotion enhancer module 18 l of FIG. 6, in accordance withembodiments of the present invention. Expression 700 illustrates a happyexpression (i.e., a happy emotion). Expression 701 illustrates a neutralexpression (i.e., a neutral emotion) converted from expression 700.Expression 702 illustrates expression 701 comprising an identifier 702 a(i.e., a polygon perimeter) to help a disabled viewer recognize thehappy expression removed from expression 700.

FIG. 8 illustrates a computer system 90 used for simulating disabilitiesand/or enhancing audio/video data streams, in accordance withembodiments of the present invention. The computer system 90 comprises aprocessor 91, an input device 92 coupled to the processor 91, an outputdevice 93 coupled to the processor 91, and memory devices 94 and 95 eachcoupled to the processor 91. The input device 92 may be, inter alia, akeyboard, a mouse, etc. The output device 93 may be, inter alia, aprinter, a plotter, a computer screen (e.g., monitor 110), a magnetictape, a removable hard disk, a floppy disk, etc. The memory devices 94and 95 may be, inter alia, a hard disk, a floppy disk, a magnetic tape,an optical storage such as a compact disc (CD) or a digital video disc(DVD), a dynamic random access memory (DRAM), a read-only memory (ROM),etc. The memory device 95 includes a computer code 97. The computer code97 includes an algorithm for simulating disabilities and/or enhancingaudio/video data streams (e.g., the algorithm of FIG. 2). The processor91 executes the computer code 97. The memory device 94 includes inputdata 96. The input data 96 includes input required by the computer code97. The output device 93 displays output from the computer code 97.Either or both memory devices 94 and 95 (or one or more additionalmemory devices not shown in FIG. 8) may comprise the algorithm of FIG. 2and may be used as a computer usable medium (or a computer readablemedium or a program storage device) having a computer readable programcode embodied therein and/or having other data stored therein, whereinthe computer readable program code comprises the computer code 97.Generally, a computer program product (or, alternatively, an article ofmanufacture) of the computer system 90 may comprise said computer usablemedium (or said program storage device).

Still yet, any of the components of the present invention could bedeployed, managed, serviced, etc. by a service provider who offers tosimulate disabilities and/or enhance audio/video data streams. Thus thepresent invention discloses a process for deploying or integratingcomputing infrastructure, comprising integrating computer-readable codeinto the computer system 90, wherein the code in combination with thecomputer system 90 is capable of performing a method for simulatingdisabilities and/or enhancing audio/video data streams. In anotherembodiment, the invention provides a business method that performs theprocess steps of the invention on a subscription, advertising, and/orfee basis. That is, a service provider, such as a Solution Integrator,could offer to simulate disabilities and/or enhance audio/video datastreams. In this case, the service provider can create, maintain,support, etc., a computer infrastructure that performs the process stepsof the invention for one or more customers. In return, the serviceprovider can receive payment from the customer(s) under a subscriptionand/or fee agreement and/or the service provider can receive paymentfrom the sale of advertising content to one or more third parties.

While FIG. 8 shows the computer system 90 as a particular configurationof hardware and software, any configuration of hardware and software, aswould be known to a person of ordinary skill in the art, may be utilizedfor the purposes stated supra in conjunction with the particularcomputer system 90 of FIG. 8. For example, the memory devices 94 and 95may be portions of a single memory device rather than separate memorydevices.

While embodiments of the present invention have been described hereinfor purposes of illustration, many modifications and changes will becomeapparent to those skilled in the art. Accordingly, the appended claimsare intended to encompass all such modifications and changes as fallwithin the true spirit and scope of this invention.

1. A simulation method, comprising: receiving, by a computing system, afirst video data stream, wherein said first video data stream comprisesfirst video data associated with a first person; monitoring, by saidcomputing system, said first video data stream; identifying, by saidcomputing system in response to said monitoring said first video datastream, first emotional attributes comprised by said first video data;generating, by said computing system in response to said identifyingsaid first emotional attributes, a second video data stream associatedwith said first video data stream, wherein said second video data streamcomprises second video data associated with said first person, andwherein said second video data does not comprise said first emotionalattributes; storing, by said computing system, said second video datastream; identifying, by said computing system, a first attribute of saidfirst emotional attributes; associating, by said computing system, afirst visual object of said second video data with said first attributeof said first emotional attributes; and generating, by said computingsystem, a viewable label for said first visual object, wherein saidviewable label indicates said first attribute of said first emotionalattributes; applying, by said computing system, said viewable label tosaid first visual object; generating, by said computing system inresponse to said applying, a third video data stream associated withsaid second video data stream, wherein said third video data streamcomprises said second video data and said first visual object comprisingsaid viewable label; and storing, by said computing system, said thirdvideo data stream.
 2. The method of claim 1, further comprising:receiving, by said computing system, a first audio data stream, whereinsaid first audio data stream comprises first speech data associated withsaid first person; monitoring, by said computing system, said firstaudio data stream; identifying, by said computing system in response tosaid monitoring said first audio data stream, second emotionalattributes comprised by said first audio data stream; generating, bysaid computing system in response to said identifying said secondemotional attributes, a second audio data stream associated with saidfirst audio data stream, wherein said second audio data stream comprisessaid first speech data, and wherein said second audio data stream doesnot comprise said second emotional attributes; and storing, by saidcomputing system, said second audio data stream.
 3. The method of claim2, further comprising: identifying, by said computing system, a firstattribute of said second emotional attributes; associating, by saidcomputing system, a first audible portion of said second audio datastream with said first attribute of said second emotional attributes;and generating, by said computing system, an audible label for saidfirst audible portion of said second audio data stream, wherein saidaudible label indicates said first attribute of said second emotionalattributes; applying, by said computing system, said audible label tosaid first audible portion of said second audio data stream; generating,by said computing system in response to said applying, a third audiodata stream associated with said second audio data stream, wherein saidthird audio data stream comprises said second audio data stream and saidfirst audible portion of said second audio data stream comprising saidaudible label; and storing, by said computing system, said third audiodata stream.
 4. The method of claim 3, further comprising:synchronizing, by said computing system, portions of said third audiodata stream with associated portions of said second video data stream;generating, by said computing system in response to said synchronizing,a first audio/video data stream associated with said third audio datastream and said second video data stream; and broadcasting, by saidcomputing system, said first audio/video data stream.
 5. The method ofclaim 2, further comprising: synchronizing, by said computing system,portions of said second audio data stream with associated portions ofsaid second video data stream; generating, by said computing system inresponse to said synchronizing, a first audio/video data streamassociated with said second audio data stream and said second video datastream; and broadcasting, by said computing system, said firstaudio/video data stream.
 6. The method of claim 1, wherein said viewablelabel comprises a label selected from the group consisting of a polygonlabel comprising a perimeter for surrounding said first visual objectand a label comprising different color applied to said first visualobject.
 7. The method of claim 1, wherein said first emotionalattributes comprise facial expressions of said first person.
 8. Themethod of claim 1, further comprising: enabling, by said computingsystem, said second video data stream; monitoring, by said computingsystem, said second video data stream; identifying, by said computingsystem in response to said monitoring said second video data stream, afirst visual image of said second video data stream, wherein said firstvisual image comprises a first visual object and a second visual object,wherein said first visual object is an intended main viewing object ofsaid first visual image, and wherein said second visual object is anintended background viewing object of said first visual image;generating, by said computing system from said first visual image, asecond visual image, wherein said second visual image comprises saidfirst visual object and said second visual object, wherein said firstvisual object comprised by said second visual image is an intendedbackground viewing object of said second visual image, and wherein saidsecond visual object comprised by said second visual image is anintended main viewing object of said second visual image; generating, bysaid computing system, a third video data stream, wherein saidgenerating, said third video data stream comprises removing said firstvisual image from said second video data stream and placing said secondvisual image in said second video data stream; and storing, by saidcomputing system, said third video data stream.
 9. The method of claim8, further comprising: generating, by said computing system, a viewablelabel for said second visual object comprised by said second visualimage, wherein said viewable label indicates that said second visualobject comprised by said second visual image is an intended main viewingobject of said second visual image; applying, by said computing system,said viewable label to said second visual object comprised by saidsecond visual image; generating, by said computing system in response tosaid applying, a fourth video data stream associated with said thirdvideo data stream, wherein said fourth video data stream comprises saidthird video data stream and said second visual object comprising saidviewable label; and storing, by said computing system, said fourth videodata stream.
 10. A process for supporting computer infrastructure, saidprocess comprising providing at least one support service for at leastone of creating, integrating, hosting, maintaining, and deployingcomputer-readable code in a computing system, wherein the code incombination with the computing system is capable of performing themethod of claim
 1. 11. A computer program product, comprising a computerstorage device storing a computer readable program code, said computerreadable program code configured to perform a method upon being executedby a processor of a computing system, said method comprising: receiving,by said computing system, a first video data stream, wherein said firstvideo data stream comprises first video data associated with a firstperson; monitoring, by said computing system, said first video datastream; identifying, by said computing system in response to saidmonitoring said first video data stream, first emotional attributescomprised by said first video data; generating, by said computing systemin response to said identifying said first emotional attributes, asecond video data stream associated with said first video data stream,wherein said second video data stream comprises second video dataassociated with said first person, and wherein said second video datadoes not comprise said first emotional attributes; storing, by saidcomputing system, said second video data stream; identifying, by saidcomputing system, a first attribute of said first emotional attributes;associating, by said computing system, a first visual object of saidsecond video data with said first attribute of said first emotionalattributes; and generating, by said computing system, a viewable labelfor said first visual object, wherein said viewable label indicates saidfirst attribute of said first emotional attributes; applying, by saidcomputing system, said viewable label to said first visual object;generating, by said computing system in response to said applying, athird video data stream associated with said second video data stream,wherein said third video data stream comprises said second video dataand said first visual object comprising said viewable label; andstoring, by said computing system, said third video data stream.
 12. Acomputing system comprising a processor coupled to a computer-readablememory unit, said memory unit comprising a computer readable codeconfigured to be executed by the processor to perform a methodcomprising: receiving, by said computing system, a first video datastream, wherein said first video data stream comprises first video dataassociated with a first person; monitoring, by said computing system,said first video data stream; identifying, by said computing system inresponse to said monitoring said first video data stream, firstemotional attributes comprised by said first video data; generating, bysaid computing system in response to said identifying said firstemotional attributes, a second video data stream associated with saidfirst video data stream, wherein said second video data stream comprisessecond video data associated with said first person, and wherein saidsecond video data does not comprise said first emotional attributes;storing, by said computing system, said second video data stream;identifying, by said computing system, a first attribute of said firstemotional attributes; associating, by said computing system, a firstvisual object of said second video data with said first attribute ofsaid first emotional attributes; and generating, by said computingsystem, a viewable label for said first visual object, wherein saidviewable label indicates said first attribute of said first emotionalattributes; applying, by said computing system, said viewable label tosaid first visual object; generating, by said computing system inresponse to said applying, a third video data stream associated withsaid second video data stream, wherein said third video data streamcomprises said second video data and said first visual object comprisingsaid viewable label; and storing, by said computing system, said thirdvideo data stream.